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The Problem of Compatible Representatives 

by Donald E. Knuth and Arvind Raghunathan 

Many combinatorial tasks can be formulated in the following way: Is there a sequence 
(xi, X2-, ■ ■ ■ , x n ) such that Xj G Aj for all j, and Xj is compatible with Xk for all j < kl Here 
A\ , Ai , . . . , ^4 n are given sets, and "compatibility" is a given relation on A\ U ^2 U . . . U A n . 

This problem is NP-hard in general. For example, if all sets Aj are the same, and if 
compatibility is a symmetric, irreflexive relation, a sequence of compatible representatives 
is nothing but an n-clique in the compatibility graph. 

The problem of coloring a graph G with c colors is another NP-hard special case of 
the general compatibility question. Let Aj be the the set of pairs {(j, 1), . . . , (j, c)}, and 
say that (j, a) is compatible with (k, b) if either a 7^ b or Vj is not adjacent to Vk in G, 
where the vertices of G are . . . , v n }. Then a sequence of compatible representatives is 
essentially a c-coloring of G. Therefore the problem is NP-hard for all c > 3. 

On the other hand, the compatibility problem also has important special cases that are 
efficiently solvable. If the compatibility relation is '7^', then a solution sequence {x\ , . . . , x n ) 
is traditionally called a system of distinct representatives [4] [3, Chapter 5], and the problem 
of finding such systems is well known to be equivalent to bipartite matching. Indeed, 
if the compatibility relation is the complement of any equivalence relation, a sequence 
{x\,X2, ■ ■ ■ ,x n ) of compatible representatives exists if and only if there is a matching of 
cardinality n in a bipartite graph on the vertices {y 1, . . . , v n , c±, . . . , c m }, where {c±, . . . , c m } 
are the equivalence classes and we have Vj — Ck if and only if Aj contains an element of 
class Ck- 

Another nice special case is equivalent to identifying increasing subsequences of a 
permutation. Let TVi...n m be a permutation of {l,...,m}, and let Aj be the set of 
pairs {(j, 1), . . . , (j, m)}. Say that (j, a) is compatible with (k, b) if and only if j < k and 
tv a < tvi,. Then a compatible sequence ((1, a\ ),..., (n, a n )) is equivalent to an increasing 
subsequence (7r ai , . . . , 7r a J of tt x . . . 7r m . 

The example in the prevous paragraph illustrates that compatibility need not be a 
symmetric relation. But when the sets Aj are pairwise disjoint, as in that case, we could 
just as well assume that compatibility is symmetric and reflexive, since our definition of 
compatible representatives makes it immaterial whether elements Xj of Aj and Xk of Ak 
are compatible unless we have j < k. 

There are, however, important special cases in which compatibility is asymmetric. 
Consider, for example, a scheduling problem in which Aj is a set of tasks that can be done 
at time j, and where Xj is compatible with Xk only when task Xj does not require the prior 
completion of Xk- 

Cartographers face an interesting case of the general compatibility problem when they 
attach alphabetic labels to dots on a map. Let Aj represent the possible ways to place the 
name of city j, and let Xj be compatible with Xk when positions Xj and Xk do not overlap 
each other or otherwise mislead a potential reader. Then a good map should be a solution 
to the problem of compatible representatives. 

Notice that the cartographic problem makes sense even if the sets Aj are infinite. The 
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task of placing disjoint labels is a fairly natural question of combinatorial geometry that 
does not appear to be a special case of any other well known problem. 

In light of this discussion, it seems worthwhile to add the problem of compatible 
representatives to the class of "combinatorial problems that deserve a name," and to 
investigate heuristics and additional special cases that turn out to have efficient solutions. 

Simple special cases. We have noted that the compatibility problem is equivalent to 
bipartite matching when incompatibility is an equivalence relation. The problem also has 
a polynomial-time solution when compatibility is transitive. Let B\ = A\, and for j > 1 
let 

Bj = { y e Aj | 3 x e Bj-i (x compatible with y) } . 

Then the compatibility problem has a solution if and only if B n is nonempty. We can 
decide this in at most X1J=2 II A? -ill 1 1 A? 1 1 steps. 

Another noteworthy special case occurs when each set Aj contains at most two el- 
ements. Then the compatibility problem is equivalent to an instance of 2SAT: We can 
assume that Aj = {vj,Vj}; the clauses are (Wj VfJ for every pair of literals such that 
j < k and aj is incompatible with ak- 
in general, if each \\Aj\\ < k and k > 2, the problem reduces directly to an instance 
of ic SAT in which each literal occurs positively just once. The literals are (j, a) for a E Aj, 
and the clauses are 

V (j, a), for 1 < j < n ; 

neAj 

(j, a) V (k,b) , for 1 < j < k < n and a incompatible with b. 

Conversely, any instance of kSAT with m clauses reduces to the compatibility problem 
of finding representatives (x±, . . . , x m ), with xj a member of the jth clause and with two 
literals compatible iff they aren't negatives of each other. 

The general compatibility problem with finite sets Aj can also be reduced to an 
independent set problem in a natural way. Consider the graph G with vertices (j, a) for 
a E Aj, having edges 

0, a) — (j, b) , if a ^ £>; 

(j, a) — (k, b) , if j < k and a is incompatible with b. 

Then G has an independent set of size n if and only if the compatibility problem has a 
solution. 

Therefore we obtain simple solutions of the compatibility problem when there is a 
simple solution to the corresponding independent set problem. One such case occurs when 
compatibility is a symmetric relation and we have the following condition: If i < j < k and 
the elements a^, aj, are mutually compatible, then (1) every element of Ai is compatible 
with either aj or (2) every element of Aj is compatible with either ai or a^; (3) every 
element of Ak is compatible with either ai or a—j; and (4) every element not in AiUAjUA^ 
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is compatible with either Xi, Xj, or xj.. In such a case the graph G is claw-free, and we 
can use Minty's algorithm [7] to find a maximum independent set. 

Grotschel, Lovasz, and Schrijver [2, Chapter 9] have compiled a survey of cases where 
the independent set problem is known to have a simple solution. 

Another hard case. A very special case of the general mapmaker's problem, alluded to 
in the introduction, turns out to be NP-complete. 

Consider a set of integer points p\ , . . . , p n on the plane. We wish to find integer points 
xi, . . . ,x n with the following properties for all j ^ k: 



\ x j Pj\ ~ 1 



\xj -p k \ > 1 ; 



\Xj x I 2 



(Motivation: Each xj is the center of a 2 x 2 square in which a "label" for point pj can 
be placed. The label at xj should be closer to pj than to any other point; distinct labels 
should not overlap.) We will call this the MFL problem, for "METRFONT labeling," because 
it arises in connection with the task of attaching labels to points in diagrams drawn by 
METRFONT [5, page 328]. 

Solutions to the MFL problem can conveniently be represented by showing each point pj 
as a heavy dot and drawing an arrow from pj to xj for each j; at most four possibilities 
exist from each of the given points. For example, it is easy to see that a cluster of four 
adjacent points can be labeled in only two ways: 



There is no way to attach a label to the middle point in a configuration like 



because each of the four positions adjacent to that point is too close to one of the other 
given points. The MFL problem provides an amusing pastime for people who are sitting in 
a boring meeting and who happen to have a tablet of graph paper to doodle on. 

The general MFL problem is clearly in NP. In order to show that it is NP-complete, 
we observe first that there are only two solutions to the problem 



namely the two solutions for four-point clusters given earlier, using the same orientation 
in each cluster. Thus we can construct large chainlike tree networks of four-point clusters, 
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for example, 



in which there are only two solutions, "positive" and "negative." This construction pro- 
vides a way to represent the values of boolean variables in a satisfiability problem. 

We can now use Lichtenstein's theorem that planar 3SAT is NP-complete [6]. An 
instance of planar 3SAT is a set of variables v\ , . . . , v n arranged in a straight line, together 
with a set of three-legged clauses above and below them, where the clauses are properly 
nested so that none of the legs between clauses and variables cross each other. We can 
always put the clauses into a rectilinear configuration such as 



Vl 



V2 



V3 



V 4 



V5 



V6 



V 7 



V8 



V9 



which corresponds to Lichtenstein's "crossover box" [6, Figure 5]. 

We construct an instance of MFL from a given instance of planar 3SAT by representing 
the vertical legs for each variable as chains of four-point clusters; this guarantees that each 
variable will have one of two values, corresponding to the common orientation of all its 



clusters. We can easily stretch out the diagram so that there is no interference between 
the variables except at places where three legs of a clause come together in a horizontal 
segment. 

It remains to specify the representation of the clauses. By symmetry we need only 
describe the representation that appears above the variables. Each horizontal section of a 
comb-like clause in the upper portion will be represented by a configuration of the form 

left arm right arm 



with 6/ + 4 dots in the left arm and 6m + 4 dots in the right arm, for some / and m. (The 
three triples at the bottom will connect to clusters that represent variables, as explained 
below. Those clusters will occur at positions that are congruent mod 6; the arms of a comb 
stretch out so that they reach the variables appropriate to the clause.) 

In each group of three dots at the bottom of this construction, the arrow for the middle 
dot must go either up or down. All three middle arrows cannot go up, because that forces 




and there is no way to attach an arrow to the middle dot in the second row. 

However, there are solutions in which any one of the middle arrows goes down. For 
example, we can choose 
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or 



and there is a third solution that is essentially a mirror image of the first. 

We can place four-point clusters below a row of three dots in such a way that a 
downward arrow on the top middle dot forces an orientation on the clusters, but an upward 
arrow on the top middle dot forces nothing: 



By choosing one of these junction configurations for each variable in the clause, depending 
on whether the variable is negated or not, we obtain an instance of MFL that has a solution 
if and only if the given planar clauses are satisfiable. 

Backtracking. We have now proved that MFL is NP-hard. However, in practice a solution 
or proof of nonexistence can often be found quickly by backtracking, using the idea of 
"preclusion" introduced by Golomb and Baumert [1]. When a trial value xj is selected 
from Aj, it precludes all selections of other Xk that are incompatible with it; precluded 
values can be (temporarily) removed from Ak- The problem of compatible representatives 
is precisely the abstract general setting that supports this notion of preclusion. 

Golomb and Baumert suggest choosing Xj at each stage from a currently smallest 
set Aj whose representative has not yet been chosen. If we are simply looking for a solution, 
instead of enumerating all solutions, it would also be worthwhile to select elements that 
preclude as few others as possible. 

For example, if an element of Aj doesn't preclude any others, we can set Xj equal to 
that element without loss of generality. If x G Aj precludes only one element y G Ak and 
no others, and if we find no solution when Xj = x, then we can set xu = y without loss of 
generality. 

Further work. A recent paper by Simon [8] considers the assignment of channels to trans- 
mitters in a radio communication system. This is another case of a compatibility problem, 
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rather like the map maker's problem because nearby transmitters must not broadcast on 
the same channel. Simon presents a polynomial-time approximation scheme that is guar- 
anteed to find at least a fixed fraction of the optimum number of compatible channels. 
This suggests that many useful approximation schemes for other instances of the general 
compatibility problem might remain to be found. 
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